Dataset statistics
| Number of variables | 36 |
|---|---|
| Number of observations | 858 |
| Missing cells | 3622 |
| Missing cells (%) | 11.7% |
| Duplicate rows | 20 |
| Duplicate rows (%) | 2.3% |
| Total size in memory | 241.4 KiB |
| Average record size in memory | 288.2 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 26 |
STDs:cervical condylomatosis has constant value "0.0" | Constant |
STDs:AIDS has constant value "0.0" | Constant |
| Dataset has 20 (2.3%) duplicate rows | Duplicates |
Age is highly overall correlated with Num of pregnancies and 1 other fields | High correlation |
Biopsy is highly overall correlated with Hinselmann and 1 other fields | High correlation |
Dx is highly overall correlated with Dx:CIN and 4 other fields | High correlation |
Dx:CIN is highly overall correlated with Dx and 2 other fields | High correlation |
Dx:Cancer is highly overall correlated with Dx and 1 other fields | High correlation |
Dx:HPV is highly overall correlated with Dx and 1 other fields | High correlation |
Hinselmann is highly overall correlated with Biopsy and 1 other fields | High correlation |
IUD is highly overall correlated with IUD (years) | High correlation |
IUD (years) is highly overall correlated with IUD | High correlation |
Num of pregnancies is highly overall correlated with Age | High correlation |
STDs is highly overall correlated with STDs (number) and 5 other fields | High correlation |
STDs (number) is highly overall correlated with STDs and 6 other fields | High correlation |
STDs: Number of diagnosis is highly overall correlated with STDs and 4 other fields | High correlation |
STDs: Time since first diagnosis is highly overall correlated with Dx and 3 other fields | High correlation |
STDs: Time since last diagnosis is highly overall correlated with Age and 4 other fields | High correlation |
STDs:HIV is highly overall correlated with STDs (number) and 1 other fields | High correlation |
STDs:condylomatosis is highly overall correlated with STDs and 3 other fields | High correlation |
STDs:syphilis is highly overall correlated with STDs (number) | High correlation |
STDs:vaginal condylomatosis is highly overall correlated with STDs (number) | High correlation |
STDs:vulvo-perineal condylomatosis is highly overall correlated with STDs and 3 other fields | High correlation |
Schiller is highly overall correlated with Biopsy and 1 other fields | High correlation |
Smokes is highly overall correlated with Smokes (years) | High correlation |
Smokes (packs/year) is highly overall correlated with Smokes (years) | High correlation |
Smokes (years) is highly overall correlated with Smokes and 1 other fields | High correlation |
STDs is highly imbalanced (51.6%) | Imbalance |
STDs (number) is highly imbalanced (72.7%) | Imbalance |
STDs:condylomatosis is highly imbalanced (67.9%) | Imbalance |
STDs:vaginal condylomatosis is highly imbalanced (95.2%) | Imbalance |
STDs:vulvo-perineal condylomatosis is highly imbalanced (68.4%) | Imbalance |
STDs:syphilis is highly imbalanced (83.7%) | Imbalance |
STDs:pelvic inflammatory disease is highly imbalanced (98.5%) | Imbalance |
STDs:genital herpes is highly imbalanced (98.5%) | Imbalance |
STDs:molluscum contagiosum is highly imbalanced (98.5%) | Imbalance |
STDs:HIV is highly imbalanced (83.7%) | Imbalance |
STDs:Hepatitis B is highly imbalanced (98.5%) | Imbalance |
STDs:HPV is highly imbalanced (97.3%) | Imbalance |
STDs: Number of diagnosis is highly imbalanced (78.2%) | Imbalance |
Dx:Cancer is highly imbalanced (85.3%) | Imbalance |
Dx:CIN is highly imbalanced (91.6%) | Imbalance |
Dx:HPV is highly imbalanced (85.3%) | Imbalance |
Dx is highly imbalanced (81.6%) | Imbalance |
Hinselmann is highly imbalanced (75.4%) | Imbalance |
Schiller is highly imbalanced (57.6%) | Imbalance |
Citology is highly imbalanced (70.8%) | Imbalance |
Biopsy is highly imbalanced (65.6%) | Imbalance |
Number of sexual partners has 26 (3.0%) missing values | Missing |
Num of pregnancies has 56 (6.5%) missing values | Missing |
Smokes has 13 (1.5%) missing values | Missing |
Smokes (years) has 13 (1.5%) missing values | Missing |
Smokes (packs/year) has 13 (1.5%) missing values | Missing |
Hormonal Contraceptives has 108 (12.6%) missing values | Missing |
Hormonal Contraceptives (years) has 108 (12.6%) missing values | Missing |
IUD has 117 (13.6%) missing values | Missing |
IUD (years) has 117 (13.6%) missing values | Missing |
STDs has 105 (12.2%) missing values | Missing |
STDs (number) has 105 (12.2%) missing values | Missing |
STDs:condylomatosis has 105 (12.2%) missing values | Missing |
STDs:cervical condylomatosis has 105 (12.2%) missing values | Missing |
STDs:vaginal condylomatosis has 105 (12.2%) missing values | Missing |
STDs:vulvo-perineal condylomatosis has 105 (12.2%) missing values | Missing |
STDs:syphilis has 105 (12.2%) missing values | Missing |
STDs:pelvic inflammatory disease has 105 (12.2%) missing values | Missing |
STDs:genital herpes has 105 (12.2%) missing values | Missing |
STDs:molluscum contagiosum has 105 (12.2%) missing values | Missing |
STDs:AIDS has 105 (12.2%) missing values | Missing |
STDs:HIV has 105 (12.2%) missing values | Missing |
STDs:Hepatitis B has 105 (12.2%) missing values | Missing |
STDs:HPV has 105 (12.2%) missing values | Missing |
STDs: Time since first diagnosis has 787 (91.7%) missing values | Missing |
STDs: Time since last diagnosis has 787 (91.7%) missing values | Missing |
Num of pregnancies has 16 (1.9%) zeros | Zeros |
Smokes (years) has 722 (84.1%) zeros | Zeros |
Smokes (packs/year) has 722 (84.1%) zeros | Zeros |
Hormonal Contraceptives (years) has 269 (31.4%) zeros | Zeros |
IUD (years) has 658 (76.7%) zeros | Zeros |
Reproduction
| Analysis started | 2024-07-19 01:28:43.257102 |
|---|---|
| Analysis finished | 2024-07-19 01:28:58.924085 |
| Duration | 15.67 seconds |
| Software version | ydata-profiling vv4.9.0 |
| Download configuration | config.json |
Age
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 44 |
|---|---|
| Distinct (%) | 5.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26.820513 |
| Minimum | 13 |
|---|---|
| Maximum | 84 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.8 KiB |
Quantile statistics
| Minimum | 13 |
|---|---|
| 5-th percentile | 16 |
| Q1 | 20 |
| median | 25 |
| Q3 | 32 |
| 95-th percentile | 41 |
| Maximum | 84 |
| Range | 71 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 8.4979481 |
|---|---|
| Coefficient of variation (CV) | 0.3168451 |
| Kurtosis | 4.7785751 |
| Mean | 26.820513 |
| Median Absolute Deviation (MAD) | 5.5 |
| Skewness | 1.3942788 |
| Sum | 23012 |
| Variance | 72.215121 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 23 | 54 | 6.3% |
| 18 | 50 | 5.8% |
| 21 | 46 | 5.4% |
| 20 | 45 | 5.2% |
| 19 | 44 | 5.1% |
| 24 | 39 | 4.5% |
| 25 | 39 | 4.5% |
| 26 | 38 | 4.4% |
| 28 | 37 | 4.3% |
| 17 | 35 | 4.1% |
| Other values (34) | 431 |
| Value | Count | Frequency (%) |
| 13 | 1 | 0.1% |
| 14 | 5 | 0.6% |
| 15 | 21 | |
| 16 | 23 | |
| 17 | 35 | |
| 18 | 50 | |
| 19 | 44 | |
| 20 | 45 | |
| 21 | 46 | |
| 22 | 30 |
| Value | Count | Frequency (%) |
| 84 | 1 | |
| 79 | 1 | |
| 70 | 2 | |
| 59 | 1 | |
| 52 | 2 | |
| 51 | 1 | |
| 50 | 1 | |
| 49 | 2 | |
| 48 | 2 | |
| 47 | 1 |
Number of sexual partners
Real number (ℝ)
MISSING 
| Distinct | 12 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 26 |
| Missing (%) | 3.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.5276442 |
| Minimum | 1 |
|---|---|
| Maximum | 28 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 28 |
| Range | 27 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.6677605 |
|---|---|
| Coefficient of variation (CV) | 0.65980823 |
| Kurtosis | 69.204754 |
| Mean | 2.5276442 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 5.4546486 |
| Sum | 2103 |
| Variance | 2.781425 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 272 | |
| 3 | 208 | |
| 1 | 206 | |
| 4 | 78 | 9.1% |
| 5 | 44 | 5.1% |
| 6 | 9 | 1.0% |
| 7 | 7 | 0.8% |
| 8 | 4 | 0.5% |
| 15 | 1 | 0.1% |
| 10 | 1 | 0.1% |
| Other values (2) | 2 | 0.2% |
| (Missing) | 26 | 3.0% |
| Value | Count | Frequency (%) |
| 1 | 206 | |
| 2 | 272 | |
| 3 | 208 | |
| 4 | 78 | 9.1% |
| 5 | 44 | 5.1% |
| 6 | 9 | 1.0% |
| 7 | 7 | 0.8% |
| 8 | 4 | 0.5% |
| 9 | 1 | 0.1% |
| 10 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 28 | 1 | 0.1% |
| 15 | 1 | 0.1% |
| 10 | 1 | 0.1% |
| 9 | 1 | 0.1% |
| 8 | 4 | 0.5% |
| 7 | 7 | 0.8% |
| 6 | 9 | 1.0% |
| 5 | 44 | 5.1% |
| 4 | 78 | 9.1% |
| 3 | 208 |
First sexual intercourse
Real number (ℝ)
| Distinct | 21 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 7 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.9953 |
| Minimum | 10 |
|---|---|
| Maximum | 32 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.8 KiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 14 |
| Q1 | 15 |
| median | 17 |
| Q3 | 18 |
| 95-th percentile | 22 |
| Maximum | 32 |
| Range | 22 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.8033554 |
|---|---|
| Coefficient of variation (CV) | 0.16494886 |
| Kurtosis | 4.28836 |
| Mean | 16.9953 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.5643746 |
| Sum | 14463 |
| Variance | 7.8588014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 15 | 163 | |
| 17 | 151 | |
| 18 | 137 | |
| 16 | 121 | |
| 14 | 79 | |
| 19 | 60 | 7.0% |
| 20 | 37 | 4.3% |
| 13 | 25 | 2.9% |
| 21 | 20 | 2.3% |
| 23 | 9 | 1.0% |
| Other values (11) | 49 | 5.7% |
| Value | Count | Frequency (%) |
| 10 | 2 | 0.2% |
| 11 | 2 | 0.2% |
| 12 | 6 | 0.7% |
| 13 | 25 | 2.9% |
| 14 | 79 | |
| 15 | 163 | |
| 16 | 121 | |
| 17 | 151 | |
| 18 | 137 | |
| 19 | 60 | 7.0% |
| Value | Count | Frequency (%) |
| 32 | 1 | 0.1% |
| 29 | 5 | 0.6% |
| 28 | 3 | 0.3% |
| 27 | 6 | 0.7% |
| 26 | 7 | 0.8% |
| 25 | 2 | 0.2% |
| 24 | 6 | 0.7% |
| 23 | 9 | |
| 22 | 9 | |
| 21 | 20 |
Num of pregnancies
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 11 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 56 |
| Missing (%) | 6.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.2755611 |
| Minimum | 0 |
|---|---|
| Maximum | 11 |
| Zeros | 16 |
| Zeros (%) | 1.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 11 |
| Range | 11 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.4474141 |
|---|---|
| Coefficient of variation (CV) | 0.63606909 |
| Kurtosis | 3.2133661 |
| Mean | 2.2755611 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.4235139 |
| Sum | 1825 |
| Variance | 2.0950075 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 270 | |
| 2 | 240 | |
| 3 | 139 | |
| 4 | 74 | 8.6% |
| 5 | 35 | 4.1% |
| 6 | 18 | 2.1% |
| 0 | 16 | 1.9% |
| 7 | 6 | 0.7% |
| 8 | 2 | 0.2% |
| 11 | 1 | 0.1% |
| (Missing) | 56 | 6.5% |
| Value | Count | Frequency (%) |
| 0 | 16 | 1.9% |
| 1 | 270 | |
| 2 | 240 | |
| 3 | 139 | |
| 4 | 74 | 8.6% |
| 5 | 35 | 4.1% |
| 6 | 18 | 2.1% |
| 7 | 6 | 0.7% |
| 8 | 2 | 0.2% |
| 10 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 11 | 1 | 0.1% |
| 10 | 1 | 0.1% |
| 8 | 2 | 0.2% |
| 7 | 6 | 0.7% |
| 6 | 18 | 2.1% |
| 5 | 35 | 4.1% |
| 4 | 74 | 8.6% |
| 3 | 139 | |
| 2 | 240 | |
| 1 | 270 |
Smokes
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 13 |
| Missing (%) | 1.5% |
| Memory size | 6.8 KiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2535 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 1.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 722 | |
| 1.0 | 123 | 14.3% |
| (Missing) | 13 | 1.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 722 | |
| 1.0 | 123 | 14.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1567 | |
| . | 845 | |
| 1 | 123 | 4.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2535 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1567 | |
| . | 845 | |
| 1 | 123 | 4.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2535 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1567 | |
| . | 845 | |
| 1 | 123 | 4.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2535 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1567 | |
| . | 845 | |
| 1 | 123 | 4.9% |
Smokes (years)
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 30 |
|---|---|
| Distinct (%) | 3.6% |
| Missing | 13 |
| Missing (%) | 1.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.2197214 |
| Minimum | 0 |
|---|---|
| Maximum | 37 |
| Zeros | 722 |
| Zeros (%) | 84.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 9.8 |
| Maximum | 37 |
| Range | 37 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 4.0890169 |
|---|---|
| Coefficient of variation (CV) | 3.3524188 |
| Kurtosis | 23.768418 |
| Mean | 1.2197214 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.4654839 |
| Sum | 1030.6646 |
| Variance | 16.72006 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 722 | |
| 1.266972909 | 15 | 1.7% |
| 5 | 9 | 1.0% |
| 9 | 9 | 1.0% |
| 1 | 8 | 0.9% |
| 3 | 7 | 0.8% |
| 2 | 7 | 0.8% |
| 16 | 6 | 0.7% |
| 7 | 6 | 0.7% |
| 8 | 6 | 0.7% |
| Other values (20) | 50 | 5.8% |
| (Missing) | 13 | 1.5% |
| Value | Count | Frequency (%) |
| 0 | 722 | |
| 0.16 | 1 | 0.1% |
| 0.5 | 3 | 0.3% |
| 1 | 8 | 0.9% |
| 1.266972909 | 15 | 1.7% |
| 2 | 7 | 0.8% |
| 3 | 7 | 0.8% |
| 4 | 5 | 0.6% |
| 5 | 9 | 1.0% |
| 6 | 4 | 0.5% |
| Value | Count | Frequency (%) |
| 37 | 1 | 0.1% |
| 34 | 1 | 0.1% |
| 32 | 1 | 0.1% |
| 28 | 1 | 0.1% |
| 24 | 1 | 0.1% |
| 22 | 2 | |
| 21 | 1 | 0.1% |
| 20 | 1 | 0.1% |
| 19 | 3 | |
| 18 | 1 | 0.1% |
Smokes (packs/year)
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 62 |
|---|---|
| Distinct (%) | 7.3% |
| Missing | 13 |
| Missing (%) | 1.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.45314395 |
| Minimum | 0 |
|---|---|
| Maximum | 37 |
| Zeros | 722 |
| Zeros (%) | 84.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2.48 |
| Maximum | 37 |
| Range | 37 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.2266098 |
|---|---|
| Coefficient of variation (CV) | 4.913692 |
| Kurtosis | 114.83971 |
| Mean | 0.45314395 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 9.3088062 |
| Sum | 382.90664 |
| Variance | 4.9577912 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 722 | |
| 0.5132021277 | 18 | 2.1% |
| 1 | 6 | 0.7% |
| 3 | 5 | 0.6% |
| 2 | 4 | 0.5% |
| 0.75 | 4 | 0.5% |
| 1.2 | 4 | 0.5% |
| 0.2 | 4 | 0.5% |
| 0.05 | 4 | 0.5% |
| 0.1 | 3 | 0.3% |
| Other values (52) | 71 | 8.3% |
| (Missing) | 13 | 1.5% |
| Value | Count | Frequency (%) |
| 0 | 722 | |
| 0.001 | 1 | 0.1% |
| 0.003 | 1 | 0.1% |
| 0.025 | 1 | 0.1% |
| 0.04 | 2 | 0.2% |
| 0.05 | 4 | 0.5% |
| 0.1 | 3 | 0.3% |
| 0.15 | 1 | 0.1% |
| 0.16 | 2 | 0.2% |
| 0.2 | 4 | 0.5% |
| Value | Count | Frequency (%) |
| 37 | 1 | 0.1% |
| 22 | 1 | 0.1% |
| 21 | 1 | 0.1% |
| 19 | 1 | 0.1% |
| 15 | 1 | 0.1% |
| 12 | 3 | |
| 9 | 2 | |
| 8 | 2 | |
| 7.6 | 1 | 0.1% |
| 7.5 | 1 | 0.1% |
Hormonal Contraceptives
Categorical
MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 108 |
| Missing (%) | 12.6% |
| Memory size | 6.8 KiB |
| 1.0 | |
|---|---|
| 0.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2250 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 481 | |
| 0.0 | 269 | |
| (Missing) | 108 | 12.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 481 | |
| 0.0 | 269 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1019 | |
| . | 750 | |
| 1 | 481 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2250 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1019 | |
| . | 750 | |
| 1 | 481 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2250 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1019 | |
| . | 750 | |
| 1 | 481 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2250 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1019 | |
| . | 750 | |
| 1 | 481 |
Hormonal Contraceptives (years)
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 40 |
|---|---|
| Distinct (%) | 5.3% |
| Missing | 108 |
| Missing (%) | 12.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.2564192 |
| Minimum | 0 |
|---|---|
| Maximum | 30 |
| Zeros | 269 |
| Zeros (%) | 31.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.5 |
| Q3 | 3 |
| 95-th percentile | 9.55 |
| Maximum | 30 |
| Range | 30 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 3.7642535 |
|---|---|
| Coefficient of variation (CV) | 1.6682421 |
| Kurtosis | 9.0433797 |
| Mean | 2.2564192 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | 2.6264377 |
| Sum | 1692.3144 |
| Variance | 14.169605 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 269 | |
| 1 | 77 | 9.0% |
| 0.25 | 41 | 4.8% |
| 2 | 40 | 4.7% |
| 3 | 39 | 4.5% |
| 5 | 34 | 4.0% |
| 0.08 | 25 | 2.9% |
| 0.5 | 25 | 2.9% |
| 6 | 24 | 2.8% |
| 4 | 22 | 2.6% |
| Other values (30) | 154 | |
| (Missing) | 108 |
| Value | Count | Frequency (%) |
| 0 | 269 | |
| 0.08 | 25 | 2.9% |
| 0.16 | 16 | 1.9% |
| 0.17 | 1 | 0.1% |
| 0.25 | 41 | 4.8% |
| 0.33 | 9 | 1.0% |
| 0.41 | 1 | 0.1% |
| 0.42 | 8 | 0.9% |
| 0.5 | 25 | 2.9% |
| 0.58 | 6 | 0.7% |
| Value | Count | Frequency (%) |
| 30 | 1 | 0.1% |
| 22 | 1 | 0.1% |
| 20 | 4 | |
| 19 | 2 | 0.2% |
| 17 | 1 | 0.1% |
| 16 | 2 | 0.2% |
| 15 | 6 | |
| 14 | 2 | 0.2% |
| 13 | 2 | 0.2% |
| 12 | 4 |
IUD
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 117 |
| Missing (%) | 13.6% |
| Memory size | 6.8 KiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2223 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 658 | |
| 1.0 | 83 | 9.7% |
| (Missing) | 117 | 13.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 658 | |
| 1.0 | 83 | 11.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1399 | |
| . | 741 | |
| 1 | 83 | 3.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2223 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1399 | |
| . | 741 | |
| 1 | 83 | 3.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2223 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1399 | |
| . | 741 | |
| 1 | 83 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2223 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1399 | |
| . | 741 | |
| 1 | 83 | 3.7% |
IUD (years)
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 26 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 117 |
| Missing (%) | 13.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.51480432 |
| Minimum | 0 |
|---|---|
| Maximum | 19 |
| Zeros | 658 |
| Zeros (%) | 76.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 4 |
| Maximum | 19 |
| Range | 19 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.9430885 |
|---|---|
| Coefficient of variation (CV) | 3.7744216 |
| Kurtosis | 29.993328 |
| Mean | 0.51480432 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.0017585 |
| Sum | 381.47 |
| Variance | 3.7755931 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 658 | |
| 3 | 11 | 1.3% |
| 2 | 10 | 1.2% |
| 5 | 9 | 1.0% |
| 1 | 8 | 0.9% |
| 8 | 7 | 0.8% |
| 7 | 7 | 0.8% |
| 4 | 5 | 0.6% |
| 6 | 5 | 0.6% |
| 11 | 3 | 0.3% |
| Other values (16) | 18 | 2.1% |
| (Missing) | 117 | 13.6% |
| Value | Count | Frequency (%) |
| 0 | 658 | |
| 0.08 | 2 | 0.2% |
| 0.16 | 1 | 0.1% |
| 0.17 | 1 | 0.1% |
| 0.25 | 1 | 0.1% |
| 0.33 | 1 | 0.1% |
| 0.41 | 1 | 0.1% |
| 0.5 | 2 | 0.2% |
| 0.58 | 1 | 0.1% |
| 0.91 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 19 | 1 | 0.1% |
| 17 | 1 | 0.1% |
| 15 | 1 | 0.1% |
| 12 | 1 | 0.1% |
| 11 | 3 | |
| 10 | 1 | 0.1% |
| 9 | 1 | 0.1% |
| 8 | 7 | |
| 7 | 7 | |
| 6 | 5 |
STDs
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 105 |
| Missing (%) | 12.2% |
| Memory size | 6.8 KiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2259 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 674 | |
| 1.0 | 79 | 9.2% |
| (Missing) | 105 | 12.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 674 | |
| 1.0 | 79 | 10.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1427 | |
| . | 753 | |
| 1 | 79 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1427 | |
| . | 753 | |
| 1 | 79 | 3.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1427 | |
| . | 753 | |
| 1 | 79 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1427 | |
| . | 753 | |
| 1 | 79 | 3.5% |
STDs (number)
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 105 |
| Missing (%) | 12.2% |
| Memory size | 6.8 KiB |
| 0.0 | |
|---|---|
| 2.0 | 37 |
| 1.0 | 34 |
| 3.0 | 7 |
| 4.0 | 1 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2259 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 674 | |
| 2.0 | 37 | 4.3% |
| 1.0 | 34 | 4.0% |
| 3.0 | 7 | 0.8% |
| 4.0 | 1 | 0.1% |
| (Missing) | 105 | 12.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 674 | |
| 2.0 | 37 | 4.9% |
| 1.0 | 34 | 4.5% |
| 3.0 | 7 | 0.9% |
| 4.0 | 1 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1427 | |
| . | 753 | |
| 2 | 37 | 1.6% |
| 1 | 34 | 1.5% |
| 3 | 7 | 0.3% |
| 4 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1427 | |
| . | 753 | |
| 2 | 37 | 1.6% |
| 1 | 34 | 1.5% |
| 3 | 7 | 0.3% |
| 4 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1427 | |
| . | 753 | |
| 2 | 37 | 1.6% |
| 1 | 34 | 1.5% |
| 3 | 7 | 0.3% |
| 4 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1427 | |
| . | 753 | |
| 2 | 37 | 1.6% |
| 1 | 34 | 1.5% |
| 3 | 7 | 0.3% |
| 4 | 1 | < 0.1% |
STDs:condylomatosis
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 105 |
| Missing (%) | 12.2% |
| Memory size | 6.8 KiB |
| 0.0 | |
|---|---|
| 1.0 | 44 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2259 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 709 | |
| 1.0 | 44 | 5.1% |
| (Missing) | 105 | 12.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 709 | |
| 1.0 | 44 | 5.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1462 | |
| . | 753 | |
| 1 | 44 | 1.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1462 | |
| . | 753 | |
| 1 | 44 | 1.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1462 | |
| . | 753 | |
| 1 | 44 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1462 | |
| . | 753 | |
| 1 | 44 | 1.9% |
STDs:cervical condylomatosis
Categorical
CONSTANT  MISSING 
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 105 |
| Missing (%) | 12.2% |
| Memory size | 6.8 KiB |
| 0.0 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2259 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 753 | |
| (Missing) | 105 | 12.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 753 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1506 | |
| . | 753 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1506 | |
| . | 753 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1506 | |
| . | 753 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1506 | |
| . | 753 |
STDs:vaginal condylomatosis
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 105 |
| Missing (%) | 12.2% |
| Memory size | 6.8 KiB |
| 0.0 | |
|---|---|
| 1.0 | 4 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2259 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 749 | |
| 1.0 | 4 | 0.5% |
| (Missing) | 105 | 12.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 749 | |
| 1.0 | 4 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1502 | |
| . | 753 | |
| 1 | 4 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1502 | |
| . | 753 | |
| 1 | 4 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1502 | |
| . | 753 | |
| 1 | 4 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1502 | |
| . | 753 | |
| 1 | 4 | 0.2% |
STDs:vulvo-perineal condylomatosis
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 105 |
| Missing (%) | 12.2% |
| Memory size | 6.8 KiB |
| 0.0 | |
|---|---|
| 1.0 | 43 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2259 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 710 | |
| 1.0 | 43 | 5.0% |
| (Missing) | 105 | 12.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 710 | |
| 1.0 | 43 | 5.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1463 | |
| . | 753 | |
| 1 | 43 | 1.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1463 | |
| . | 753 | |
| 1 | 43 | 1.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1463 | |
| . | 753 | |
| 1 | 43 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1463 | |
| . | 753 | |
| 1 | 43 | 1.9% |
STDs:syphilis
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 105 |
| Missing (%) | 12.2% |
| Memory size | 6.8 KiB |
| 0.0 | |
|---|---|
| 1.0 | 18 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2259 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 735 | |
| 1.0 | 18 | 2.1% |
| (Missing) | 105 | 12.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 735 | |
| 1.0 | 18 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1488 | |
| . | 753 | |
| 1 | 18 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1488 | |
| . | 753 | |
| 1 | 18 | 0.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1488 | |
| . | 753 | |
| 1 | 18 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1488 | |
| . | 753 | |
| 1 | 18 | 0.8% |
STDs:pelvic inflammatory disease
Categorical
IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 105 |
| Missing (%) | 12.2% |
| Memory size | 6.8 KiB |
| 0.0 | |
|---|---|
| 1.0 | 1 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2259 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 752 | |
| 1.0 | 1 | 0.1% |
| (Missing) | 105 | 12.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 752 | |
| 1.0 | 1 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1505 | |
| . | 753 | |
| 1 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1505 | |
| . | 753 | |
| 1 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1505 | |
| . | 753 | |
| 1 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1505 | |
| . | 753 | |
| 1 | 1 | < 0.1% |
STDs:genital herpes
Categorical
IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 105 |
| Missing (%) | 12.2% |
| Memory size | 6.8 KiB |
| 0.0 | |
|---|---|
| 1.0 | 1 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2259 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 752 | |
| 1.0 | 1 | 0.1% |
| (Missing) | 105 | 12.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 752 | |
| 1.0 | 1 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1505 | |
| . | 753 | |
| 1 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1505 | |
| . | 753 | |
| 1 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1505 | |
| . | 753 | |
| 1 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1505 | |
| . | 753 | |
| 1 | 1 | < 0.1% |
STDs:molluscum contagiosum
Categorical
IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 105 |
| Missing (%) | 12.2% |
| Memory size | 6.8 KiB |
| 0.0 | |
|---|---|
| 1.0 | 1 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2259 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 752 | |
| 1.0 | 1 | 0.1% |
| (Missing) | 105 | 12.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 752 | |
| 1.0 | 1 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1505 | |
| . | 753 | |
| 1 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1505 | |
| . | 753 | |
| 1 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1505 | |
| . | 753 | |
| 1 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1505 | |
| . | 753 | |
| 1 | 1 | < 0.1% |
STDs:AIDS
Categorical
CONSTANT  MISSING 
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 105 |
| Missing (%) | 12.2% |
| Memory size | 6.8 KiB |
| 0.0 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2259 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 753 | |
| (Missing) | 105 | 12.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 753 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1506 | |
| . | 753 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1506 | |
| . | 753 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1506 | |
| . | 753 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1506 | |
| . | 753 |
STDs:HIV
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 105 |
| Missing (%) | 12.2% |
| Memory size | 6.8 KiB |
| 0.0 | |
|---|---|
| 1.0 | 18 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2259 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 735 | |
| 1.0 | 18 | 2.1% |
| (Missing) | 105 | 12.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 735 | |
| 1.0 | 18 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1488 | |
| . | 753 | |
| 1 | 18 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1488 | |
| . | 753 | |
| 1 | 18 | 0.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1488 | |
| . | 753 | |
| 1 | 18 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1488 | |
| . | 753 | |
| 1 | 18 | 0.8% |
STDs:Hepatitis B
Categorical
IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 105 |
| Missing (%) | 12.2% |
| Memory size | 6.8 KiB |
| 0.0 | |
|---|---|
| 1.0 | 1 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2259 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 752 | |
| 1.0 | 1 | 0.1% |
| (Missing) | 105 | 12.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 752 | |
| 1.0 | 1 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1505 | |
| . | 753 | |
| 1 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1505 | |
| . | 753 | |
| 1 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1505 | |
| . | 753 | |
| 1 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1505 | |
| . | 753 | |
| 1 | 1 | < 0.1% |
STDs:HPV
Categorical
IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 105 |
| Missing (%) | 12.2% |
| Memory size | 6.8 KiB |
| 0.0 | |
|---|---|
| 1.0 | 2 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2259 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 751 | |
| 1.0 | 2 | 0.2% |
| (Missing) | 105 | 12.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 751 | |
| 1.0 | 2 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1504 | |
| . | 753 | |
| 1 | 2 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1504 | |
| . | 753 | |
| 1 | 2 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1504 | |
| . | 753 | |
| 1 | 2 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2259 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1504 | |
| . | 753 | |
| 1 | 2 | 0.1% |
STDs: Number of diagnosis
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.8 KiB |
| 0 | |
|---|---|
| 1 | 68 |
| 2 | 2 |
| 3 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 858 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 787 | |
| 1 | 68 | 7.9% |
| 2 | 2 | 0.2% |
| 3 | 1 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 787 | |
| 1 | 68 | 7.9% |
| 2 | 2 | 0.2% |
| 3 | 1 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 787 | |
| 1 | 68 | 7.9% |
| 2 | 2 | 0.2% |
| 3 | 1 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 858 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 787 | |
| 1 | 68 | 7.9% |
| 2 | 2 | 0.2% |
| 3 | 1 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 858 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 787 | |
| 1 | 68 | 7.9% |
| 2 | 2 | 0.2% |
| 3 | 1 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 858 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 787 | |
| 1 | 68 | 7.9% |
| 2 | 2 | 0.2% |
| 3 | 1 | 0.1% |
STDs: Time since first diagnosis
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 18 |
|---|---|
| Distinct (%) | 25.4% |
| Missing | 787 |
| Missing (%) | 91.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.1408451 |
| Minimum | 1 |
|---|---|
| Maximum | 22 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 8 |
| 95-th percentile | 19 |
| Maximum | 22 |
| Range | 21 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 5.895024 |
|---|---|
| Coefficient of variation (CV) | 0.9599695 |
| Kurtosis | 0.68227866 |
| Mean | 6.1408451 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 1.3261791 |
| Sum | 436 |
| Variance | 34.751308 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 15 | 1.7% |
| 3 | 10 | 1.2% |
| 2 | 9 | 1.0% |
| 4 | 6 | 0.7% |
| 7 | 5 | 0.6% |
| 16 | 4 | 0.5% |
| 5 | 4 | 0.5% |
| 8 | 3 | 0.3% |
| 6 | 3 | 0.3% |
| 19 | 2 | 0.2% |
| Other values (8) | 10 | 1.2% |
| (Missing) | 787 |
| Value | Count | Frequency (%) |
| 1 | 15 | |
| 2 | 9 | |
| 3 | 10 | |
| 4 | 6 | 0.7% |
| 5 | 4 | 0.5% |
| 6 | 3 | 0.3% |
| 7 | 5 | 0.6% |
| 8 | 3 | 0.3% |
| 9 | 1 | 0.1% |
| 10 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 22 | 1 | 0.1% |
| 21 | 2 | |
| 19 | 2 | |
| 18 | 1 | 0.1% |
| 16 | 4 | |
| 15 | 1 | 0.1% |
| 12 | 1 | 0.1% |
| 11 | 2 | |
| 10 | 1 | 0.1% |
| 9 | 1 | 0.1% |
STDs: Time since last diagnosis
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 18 |
|---|---|
| Distinct (%) | 25.4% |
| Missing | 787 |
| Missing (%) | 91.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.8169014 |
| Minimum | 1 |
|---|---|
| Maximum | 22 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 7.5 |
| 95-th percentile | 18.5 |
| Maximum | 22 |
| Range | 21 |
| Interquartile range (IQR) | 5.5 |
Descriptive statistics
| Standard deviation | 5.7552705 |
|---|---|
| Coefficient of variation (CV) | 0.98940486 |
| Kurtosis | 1.0169533 |
| Mean | 5.8169014 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.4112042 |
| Sum | 413 |
| Variance | 33.123139 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 17 | 2.0% |
| 2 | 10 | 1.2% |
| 3 | 9 | 1.0% |
| 4 | 6 | 0.7% |
| 7 | 5 | 0.6% |
| 16 | 4 | 0.5% |
| 5 | 3 | 0.3% |
| 8 | 3 | 0.3% |
| 6 | 3 | 0.3% |
| 11 | 2 | 0.2% |
| Other values (8) | 9 | 1.0% |
| (Missing) | 787 |
| Value | Count | Frequency (%) |
| 1 | 17 | |
| 2 | 10 | |
| 3 | 9 | |
| 4 | 6 | 0.7% |
| 5 | 3 | 0.3% |
| 6 | 3 | 0.3% |
| 7 | 5 | 0.6% |
| 8 | 3 | 0.3% |
| 9 | 1 | 0.1% |
| 10 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 22 | 1 | 0.1% |
| 21 | 2 | |
| 19 | 1 | 0.1% |
| 18 | 1 | 0.1% |
| 16 | 4 | |
| 15 | 1 | 0.1% |
| 12 | 1 | 0.1% |
| 11 | 2 | |
| 10 | 1 | 0.1% |
| 9 | 1 | 0.1% |
Dx:Cancer
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.8 KiB |
| 0 | |
|---|---|
| 1 | 18 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 858 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 840 | |
| 1 | 18 | 2.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 840 | |
| 1 | 18 | 2.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 840 | |
| 1 | 18 | 2.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 858 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 840 | |
| 1 | 18 | 2.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 858 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 840 | |
| 1 | 18 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 858 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 840 | |
| 1 | 18 | 2.1% |
Dx:CIN
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.8 KiB |
| 0 | |
|---|---|
| 1 | 9 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 858 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 849 | |
| 1 | 9 | 1.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 849 | |
| 1 | 9 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 849 | |
| 1 | 9 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 858 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 849 | |
| 1 | 9 | 1.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 858 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 849 | |
| 1 | 9 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 858 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 849 | |
| 1 | 9 | 1.0% |
Dx:HPV
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.8 KiB |
| 0 | |
|---|---|
| 1 | 18 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 858 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 840 | |
| 1 | 18 | 2.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 840 | |
| 1 | 18 | 2.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 840 | |
| 1 | 18 | 2.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 858 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 840 | |
| 1 | 18 | 2.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 858 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 840 | |
| 1 | 18 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 858 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 840 | |
| 1 | 18 | 2.1% |
Dx
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.8 KiB |
| 0 | |
|---|---|
| 1 | 24 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 858 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 834 | |
| 1 | 24 | 2.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 834 | |
| 1 | 24 | 2.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 834 | |
| 1 | 24 | 2.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 858 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 834 | |
| 1 | 24 | 2.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 858 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 834 | |
| 1 | 24 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 858 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 834 | |
| 1 | 24 | 2.8% |
Hinselmann
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.8 KiB |
| 0 | |
|---|---|
| 1 | 35 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 858 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 823 | |
| 1 | 35 | 4.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 823 | |
| 1 | 35 | 4.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 823 | |
| 1 | 35 | 4.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 858 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 823 | |
| 1 | 35 | 4.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 858 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 823 | |
| 1 | 35 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 858 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 823 | |
| 1 | 35 | 4.1% |
Schiller
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.8 KiB |
| 0 | |
|---|---|
| 1 | 74 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 858 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 784 | |
| 1 | 74 | 8.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 784 | |
| 1 | 74 | 8.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 784 | |
| 1 | 74 | 8.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 858 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 784 | |
| 1 | 74 | 8.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 858 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 784 | |
| 1 | 74 | 8.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 858 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 784 | |
| 1 | 74 | 8.6% |
Citology
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.8 KiB |
| 0 | |
|---|---|
| 1 | 44 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 858 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 814 | |
| 1 | 44 | 5.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 814 | |
| 1 | 44 | 5.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 814 | |
| 1 | 44 | 5.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 858 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 814 | |
| 1 | 44 | 5.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 858 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 814 | |
| 1 | 44 | 5.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 858 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 814 | |
| 1 | 44 | 5.1% |
Biopsy
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.8 KiB |
| 0 | |
|---|---|
| 1 | 55 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 858 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 803 | |
| 1 | 55 | 6.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 803 | |
| 1 | 55 | 6.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 803 | |
| 1 | 55 | 6.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 858 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 803 | |
| 1 | 55 | 6.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 858 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 803 | |
| 1 | 55 | 6.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 858 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 803 | |
| 1 | 55 | 6.4% |
| Age | Biopsy | Citology | Dx | Dx:CIN | Dx:Cancer | Dx:HPV | First sexual intercourse | Hinselmann | Hormonal Contraceptives | Hormonal Contraceptives (years) | IUD | IUD (years) | Num of pregnancies | Number of sexual partners | STDs | STDs (number) | STDs: Number of diagnosis | STDs: Time since first diagnosis | STDs: Time since last diagnosis | STDs:HIV | STDs:HPV | STDs:Hepatitis B | STDs:condylomatosis | STDs:genital herpes | STDs:molluscum contagiosum | STDs:pelvic inflammatory disease | STDs:syphilis | STDs:vaginal condylomatosis | STDs:vulvo-perineal condylomatosis | Schiller | Smokes | Smokes (packs/year) | Smokes (years) | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Age | 1.000 | 0.056 | 0.000 | 0.200 | 0.334 | 0.083 | 0.075 | 0.439 | 0.000 | 0.192 | 0.263 | 0.281 | 0.289 | 0.525 | 0.214 | 0.000 | 0.000 | 0.000 | 0.429 | 0.518 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.107 | 0.028 | 0.057 | 0.064 |
| Biopsy | 0.056 | 1.000 | 0.315 | 0.139 | 0.083 | 0.140 | 0.140 | 0.068 | 0.535 | 0.000 | 0.191 | 0.026 | 0.090 | 0.090 | 0.000 | 0.094 | 0.085 | 0.102 | 0.000 | 0.000 | 0.104 | 0.000 | 0.000 | 0.066 | 0.049 | 0.000 | 0.000 | 0.000 | 0.000 | 0.069 | 0.724 | 0.000 | 0.106 | 0.049 |
| Citology | 0.000 | 0.315 | 1.000 | 0.064 | 0.000 | 0.089 | 0.089 | 0.000 | 0.176 | 0.000 | 0.174 | 0.000 | 0.000 | 0.000 | 0.000 | 0.021 | 0.000 | 0.037 | 0.000 | 0.000 | 0.045 | 0.000 | 0.000 | 0.038 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.040 | 0.351 | 0.000 | 0.129 | 0.000 |
| Dx | 0.200 | 0.139 | 0.064 | 1.000 | 0.572 | 0.640 | 0.591 | 0.042 | 0.042 | 0.000 | 0.000 | 0.134 | 0.145 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.702 | 0.702 | 0.000 | 0.055 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.079 | 0.048 | 0.000 | 0.000 |
| Dx:CIN | 0.334 | 0.083 | 0.000 | 0.572 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.015 | 0.000 | 0.023 | 0.000 | 0.000 | 0.000 | 0.000 | 0.940 | 0.940 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
| Dx:Cancer | 0.083 | 0.140 | 0.089 | 0.640 | 0.000 | 1.000 | 0.858 | 0.102 | 0.109 | 0.000 | 0.100 | 0.090 | 0.127 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.335 | 0.335 | 0.000 | 0.243 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.139 | 0.000 | 0.226 | 0.138 |
| Dx:HPV | 0.075 | 0.140 | 0.089 | 0.591 | 0.000 | 0.858 | 1.000 | 0.033 | 0.109 | 0.000 | 0.106 | 0.027 | 0.012 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.335 | 0.335 | 0.000 | 0.243 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.139 | 0.000 | 0.226 | 0.138 |
| First sexual intercourse | 0.439 | 0.068 | 0.000 | 0.042 | 0.000 | 0.102 | 0.033 | 1.000 | 0.031 | 0.092 | 0.080 | 0.000 | -0.018 | -0.020 | -0.122 | 0.000 | 0.009 | 0.000 | 0.099 | 0.135 | 0.000 | 0.000 | 0.000 | 0.035 | 0.000 | 0.000 | 0.000 | 0.060 | 0.196 | 0.037 | 0.000 | 0.104 | -0.137 | -0.133 |
| Hinselmann | 0.000 | 0.535 | 0.176 | 0.042 | 0.000 | 0.109 | 0.109 | 0.031 | 1.000 | 0.000 | 0.137 | 0.000 | 0.064 | 0.086 | 0.000 | 0.009 | 0.155 | 0.158 | 0.000 | 0.000 | 0.058 | 0.000 | 0.000 | 0.014 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.018 | 0.639 | 0.000 | 0.150 | 0.079 |
| Hormonal Contraceptives | 0.192 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.092 | 0.000 | 1.000 | 0.395 | 0.000 | 0.084 | 0.237 | 0.041 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.062 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.017 | 0.000 | 0.000 | 0.000 | 0.000 | 0.019 |
| Hormonal Contraceptives (years) | 0.263 | 0.191 | 0.174 | 0.000 | 0.000 | 0.100 | 0.106 | 0.080 | 0.137 | 0.395 | 1.000 | 0.155 | 0.052 | 0.280 | 0.067 | 0.000 | 0.000 | 0.000 | 0.192 | 0.246 | 0.000 | 0.112 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.060 | 0.000 | 0.000 | 0.158 | 0.064 | 0.046 | 0.047 |
| IUD | 0.281 | 0.026 | 0.000 | 0.134 | 0.015 | 0.090 | 0.027 | 0.000 | 0.000 | 0.000 | 0.155 | 1.000 | 0.852 | 0.245 | 0.000 | 0.032 | 0.079 | 0.000 | 0.239 | 0.271 | 0.000 | 0.000 | 0.000 | 0.059 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.039 | 0.069 | 0.034 | 0.000 | 0.078 |
| IUD (years) | 0.289 | 0.090 | 0.000 | 0.145 | 0.000 | 0.127 | 0.012 | -0.018 | 0.064 | 0.084 | 0.052 | 0.852 | 1.000 | 0.244 | 0.078 | 0.000 | 0.000 | 0.000 | 0.203 | 0.237 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.169 | 0.000 | -0.050 | -0.047 |
| Num of pregnancies | 0.525 | 0.090 | 0.000 | 0.000 | 0.023 | 0.000 | 0.000 | -0.020 | 0.086 | 0.237 | 0.280 | 0.245 | 0.244 | 1.000 | 0.170 | 0.066 | 0.045 | 0.000 | 0.313 | 0.372 | 0.000 | 0.000 | 0.000 | 0.075 | 0.000 | 0.034 | 0.000 | 0.211 | 0.000 | 0.065 | 0.116 | 0.098 | 0.059 | 0.061 |
| Number of sexual partners | 0.214 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | -0.122 | 0.000 | 0.041 | 0.067 | 0.000 | 0.078 | 0.170 | 1.000 | 0.087 | 0.057 | 0.000 | 0.265 | 0.304 | 0.000 | 0.000 | 0.000 | 0.147 | 0.000 | 0.042 | 0.042 | 0.000 | 0.000 | 0.149 | 0.000 | 0.185 | 0.248 | 0.244 |
| STDs | 0.000 | 0.094 | 0.021 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.009 | 0.000 | 0.000 | 0.032 | 0.000 | 0.066 | 0.087 | 1.000 | 0.998 | 0.941 | 1.000 | 1.000 | 0.442 | 0.102 | 0.030 | 0.718 | 0.030 | 0.030 | 0.030 | 0.442 | 0.180 | 0.709 | 0.093 | 0.115 | 0.148 | 0.149 |
| STDs (number) | 0.000 | 0.085 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.009 | 0.155 | 0.000 | 0.000 | 0.079 | 0.000 | 0.045 | 0.057 | 0.998 | 1.000 | 0.845 | 0.000 | 0.000 | 0.621 | 0.226 | 0.143 | 0.986 | 0.151 | 0.151 | 0.151 | 0.681 | 0.565 | 0.974 | 0.133 | 0.128 | 0.028 | 0.131 |
| STDs: Number of diagnosis | 0.000 | 0.102 | 0.037 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.158 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.941 | 0.845 | 1.000 | 0.000 | 0.000 | 0.575 | 0.038 | 0.097 | 0.703 | 0.097 | 0.097 | 0.097 | 0.440 | 0.223 | 0.693 | 0.146 | 0.103 | 0.054 | 0.054 |
| STDs: Time since first diagnosis | 0.429 | 0.000 | 0.000 | 0.702 | 0.940 | 0.335 | 0.335 | 0.099 | 0.000 | 0.000 | 0.192 | 0.239 | 0.203 | 0.313 | 0.265 | 1.000 | 0.000 | 0.000 | 1.000 | 0.925 | 0.155 | 0.335 | 0.445 | 0.163 | 0.000 | 0.000 | 0.445 | 0.066 | 0.000 | 0.142 | 0.213 | 0.000 | 0.151 | 0.144 |
| STDs: Time since last diagnosis | 0.518 | 0.000 | 0.000 | 0.702 | 0.940 | 0.335 | 0.335 | 0.135 | 0.000 | 0.000 | 0.246 | 0.271 | 0.237 | 0.372 | 0.304 | 1.000 | 0.000 | 0.000 | 0.925 | 1.000 | 0.000 | 0.335 | 0.445 | 0.144 | 0.000 | 0.000 | 0.445 | 0.000 | 0.000 | 0.109 | 0.222 | 0.000 | 0.168 | 0.161 |
| STDs:HIV | 0.000 | 0.104 | 0.045 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.058 | 0.062 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.442 | 0.621 | 0.575 | 0.155 | 0.000 | 1.000 | 0.000 | 0.108 | 0.083 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.085 | 0.104 | 0.038 | 0.173 | 0.216 |
| STDs:HPV | 0.000 | 0.000 | 0.000 | 0.055 | 0.000 | 0.243 | 0.243 | 0.000 | 0.000 | 0.000 | 0.112 | 0.000 | 0.000 | 0.000 | 0.000 | 0.102 | 0.226 | 0.038 | 0.335 | 0.335 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.093 |
| STDs:Hepatitis B | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.030 | 0.143 | 0.097 | 0.445 | 0.445 | 0.108 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.006 | 0.272 | 0.295 |
| STDs:condylomatosis | 0.000 | 0.066 | 0.038 | 0.000 | 0.000 | 0.000 | 0.000 | 0.035 | 0.014 | 0.000 | 0.000 | 0.059 | 0.000 | 0.075 | 0.147 | 0.718 | 0.986 | 0.703 | 0.163 | 0.144 | 0.083 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.252 | 0.976 | 0.093 | 0.044 | 0.066 | 0.024 |
| STDs:genital herpes | 0.000 | 0.049 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.030 | 0.151 | 0.097 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
| STDs:molluscum contagiosum | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.034 | 0.042 | 0.030 | 0.151 | 0.097 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
| STDs:pelvic inflammatory disease | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.042 | 0.030 | 0.151 | 0.097 | 0.445 | 0.445 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
| STDs:syphilis | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.060 | 0.000 | 0.000 | 0.060 | 0.000 | 0.000 | 0.211 | 0.000 | 0.442 | 0.681 | 0.440 | 0.066 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.069 | 0.000 | 0.000 |
| STDs:vaginal condylomatosis | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.196 | 0.000 | 0.017 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.180 | 0.565 | 0.223 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.252 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.175 | 0.000 | 0.032 | 0.103 | 0.310 |
| STDs:vulvo-perineal condylomatosis | 0.000 | 0.069 | 0.040 | 0.000 | 0.000 | 0.000 | 0.000 | 0.037 | 0.018 | 0.000 | 0.000 | 0.039 | 0.000 | 0.065 | 0.149 | 0.709 | 0.974 | 0.693 | 0.142 | 0.109 | 0.085 | 0.000 | 0.000 | 0.976 | 0.000 | 0.000 | 0.000 | 0.000 | 0.175 | 1.000 | 0.097 | 0.048 | 0.069 | 0.034 |
| Schiller | 0.107 | 0.724 | 0.351 | 0.079 | 0.000 | 0.139 | 0.139 | 0.000 | 0.639 | 0.000 | 0.158 | 0.069 | 0.169 | 0.116 | 0.000 | 0.093 | 0.133 | 0.146 | 0.213 | 0.222 | 0.104 | 0.000 | 0.000 | 0.093 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.097 | 1.000 | 0.034 | 0.084 | 0.134 |
| Smokes | 0.028 | 0.000 | 0.000 | 0.048 | 0.000 | 0.000 | 0.000 | 0.104 | 0.000 | 0.000 | 0.064 | 0.034 | 0.000 | 0.098 | 0.185 | 0.115 | 0.128 | 0.103 | 0.000 | 0.000 | 0.038 | 0.000 | 0.006 | 0.044 | 0.000 | 0.000 | 0.000 | 0.069 | 0.032 | 0.048 | 0.034 | 1.000 | 0.441 | 0.788 |
| Smokes (packs/year) | 0.057 | 0.106 | 0.129 | 0.000 | 0.000 | 0.226 | 0.226 | -0.137 | 0.150 | 0.000 | 0.046 | 0.000 | -0.050 | 0.059 | 0.248 | 0.148 | 0.028 | 0.054 | 0.151 | 0.168 | 0.173 | 0.000 | 0.272 | 0.066 | 0.000 | 0.000 | 0.000 | 0.000 | 0.103 | 0.069 | 0.084 | 0.441 | 1.000 | 0.997 |
| Smokes (years) | 0.064 | 0.049 | 0.000 | 0.000 | 0.000 | 0.138 | 0.138 | -0.133 | 0.079 | 0.019 | 0.047 | 0.078 | -0.047 | 0.061 | 0.244 | 0.149 | 0.131 | 0.054 | 0.144 | 0.161 | 0.216 | 0.093 | 0.295 | 0.024 | 0.000 | 0.000 | 0.000 | 0.000 | 0.310 | 0.034 | 0.134 | 0.788 | 0.997 | 1.000 |
| Age | Number of sexual partners | First sexual intercourse | Num of pregnancies | Smokes | Smokes (years) | Smokes (packs/year) | Hormonal Contraceptives | Hormonal Contraceptives (years) | IUD | IUD (years) | STDs | STDs (number) | STDs:condylomatosis | STDs:cervical condylomatosis | STDs:vaginal condylomatosis | STDs:vulvo-perineal condylomatosis | STDs:syphilis | STDs:pelvic inflammatory disease | STDs:genital herpes | STDs:molluscum contagiosum | STDs:AIDS | STDs:HIV | STDs:Hepatitis B | STDs:HPV | STDs: Number of diagnosis | STDs: Time since first diagnosis | STDs: Time since last diagnosis | Dx:Cancer | Dx:CIN | Dx:HPV | Dx | Hinselmann | Schiller | Citology | Biopsy | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 18 | 4.0 | 15.0 | 1.0 | 0.0 | 0.000000 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 1 | 15 | 1.0 | 14.0 | 1.0 | 0.0 | 0.000000 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 2 | 34 | 1.0 | NaN | 1.0 | 0.0 | 0.000000 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 3 | 52 | 5.0 | 16.0 | 4.0 | 1.0 | 37.000000 | 37.0 | 1.0 | 3.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | NaN | NaN | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| 4 | 46 | 3.0 | 21.0 | 4.0 | 0.0 | 0.000000 | 0.0 | 1.0 | 15.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 5 | 42 | 3.0 | 23.0 | 2.0 | 0.0 | 0.000000 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 6 | 51 | 3.0 | 17.0 | 6.0 | 1.0 | 34.000000 | 3.4 | 0.0 | 0.0 | 1.0 | 7.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | NaN | NaN | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 1 |
| 7 | 26 | 1.0 | 26.0 | 3.0 | 0.0 | 0.000000 | 0.0 | 1.0 | 2.0 | 1.0 | 7.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 8 | 45 | 1.0 | 20.0 | 5.0 | 0.0 | 0.000000 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | NaN | NaN | 1 | 0 | 1 | 1 | 0 | 0 | 0 | 0 |
| 9 | 44 | 3.0 | 15.0 | NaN | 1.0 | 1.266973 | 2.8 | 0.0 | 0.0 | NaN | NaN | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Age | Number of sexual partners | First sexual intercourse | Num of pregnancies | Smokes | Smokes (years) | Smokes (packs/year) | Hormonal Contraceptives | Hormonal Contraceptives (years) | IUD | IUD (years) | STDs | STDs (number) | STDs:condylomatosis | STDs:cervical condylomatosis | STDs:vaginal condylomatosis | STDs:vulvo-perineal condylomatosis | STDs:syphilis | STDs:pelvic inflammatory disease | STDs:genital herpes | STDs:molluscum contagiosum | STDs:AIDS | STDs:HIV | STDs:Hepatitis B | STDs:HPV | STDs: Number of diagnosis | STDs: Time since first diagnosis | STDs: Time since last diagnosis | Dx:Cancer | Dx:CIN | Dx:HPV | Dx | Hinselmann | Schiller | Citology | Biopsy | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 848 | 31 | 3.0 | 18.0 | 1.0 | 0.0 | 0.0 | 0.00 | 1.0 | 0.50 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 849 | 32 | 3.0 | 18.0 | 1.0 | 1.0 | 11.0 | 0.16 | 1.0 | 6.00 | 0.0 | 0.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0 | NaN | NaN | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| 850 | 19 | 1.0 | 14.0 | 0.0 | 0.0 | 0.0 | 0.00 | 0.0 | 0.00 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 851 | 23 | 2.0 | 15.0 | 2.0 | 0.0 | 0.0 | 0.00 | 0.0 | 0.00 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 852 | 43 | 3.0 | 17.0 | 3.0 | 0.0 | 0.0 | 0.00 | 1.0 | 5.00 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 853 | 34 | 3.0 | 18.0 | 0.0 | 0.0 | 0.0 | 0.00 | 0.0 | 0.00 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 854 | 32 | 2.0 | 19.0 | 1.0 | 0.0 | 0.0 | 0.00 | 1.0 | 8.00 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 855 | 25 | 2.0 | 17.0 | 0.0 | 0.0 | 0.0 | 0.00 | 1.0 | 0.08 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 |
| 856 | 33 | 2.0 | 24.0 | 2.0 | 0.0 | 0.0 | 0.00 | 1.0 | 0.08 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 857 | 29 | 2.0 | 20.0 | 1.0 | 0.0 | 0.0 | 0.00 | 1.0 | 0.50 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Most frequently occurring
| Age | Number of sexual partners | First sexual intercourse | Num of pregnancies | Smokes | Smokes (years) | Smokes (packs/year) | Hormonal Contraceptives | Hormonal Contraceptives (years) | IUD | IUD (years) | STDs | STDs (number) | STDs:condylomatosis | STDs:cervical condylomatosis | STDs:vaginal condylomatosis | STDs:vulvo-perineal condylomatosis | STDs:syphilis | STDs:pelvic inflammatory disease | STDs:genital herpes | STDs:molluscum contagiosum | STDs:AIDS | STDs:HIV | STDs:Hepatitis B | STDs:HPV | STDs: Number of diagnosis | STDs: Time since first diagnosis | STDs: Time since last diagnosis | Dx:Cancer | Dx:CIN | Dx:HPV | Dx | Hinselmann | Schiller | Citology | Biopsy | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 15 | 1.0 | 14.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.00 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| 7 | 17 | 2.0 | 15.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.00 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| 1 | 15 | 1.0 | 15.0 | 1.0 | 0.0 | 0.0 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| 2 | 15 | 2.0 | 14.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.00 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| 3 | 16 | 1.0 | 14.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.00 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| 4 | 16 | 1.0 | 15.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.00 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| 5 | 17 | 1.0 | 16.0 | 1.0 | 0.0 | 0.0 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| 6 | 17 | 1.0 | 17.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.00 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| 8 | 17 | 2.0 | 15.0 | 1.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.33 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| 9 | 18 | 1.0 | 14.0 | 2.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.00 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |